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Introductory Statement 

The Center for S^C-ial Organization o5 Schools has o primary 
objectives: to develop a scientific knowLsilge of how schQ)o is affect theitr 
students, and to use thus knowledge to dgcpginp better scb practi^ -^-^ 
and organization. ♦ ^ 

The Center works rhrrough four progrsins to achieve ls ob jectdves . 
The Policy Studies in Sc±ool Desegregatior prrDgram applies the basic 
theories of social organization of schools ro study the .vatenml conditicKs 
of desegregated schools, rhe feasibility of alternative <fesegregati> on 
policies, and the interr<r:!.ation of school desegregation^^ ^ OL±eT eqoi . 5 
issues such as housing acrd job desegregation. The S choc t i/rimm, 7.^ t ion 
program is currently coiLLierned with authority-control st *-iictiiiH=:s , ta. • 
structures, reward systems, and peer group processes in schooi.s. it: 114 s 
produced a large-^cal^ stiucy of the effects of open schools, "^s Joevelo eid 
Student Team Leapr*n.ng Instructional processes for teaching va-^ois ^asr\,^^'^xs 
in elementary ant) seccoiiary sciiools, and has produced a compu::^:srjireid 
system for school wide «ctendance^ monitoring. The Scrrool Pror"*''^^ and 
Career Development proj^p.ram is studying transitions from high scncsdl to 
post secondary ins -itur ions and the role of schooling in the develo- jier.r 
of career plans si:^ the actixalization of labor market outcomes. Ifee 
Studies in Del'^'^mw^^cy aad School Environments program is exanu.niag' tht 
interaction of snirrool eosrironments , school experiences, aud iTidii^±dual 
characteristics zr relarr on to in-school and later-life delinquency 

This report, prepared by the Policy Studies in School Desonegat £01 
program, furthers the scijtsntif ic measurement of racial and etnnic ^ 
gation by cbmparing the .irrf lity of two measures-"the Index of Dis^umi^,. rity 
and the Segregation Ind*>x. 
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Abfft Vffcrt 

Most measuremeatis osf racial and ethnic segregations—particularly 
comparative analyses Mci^s^ diies—h^e relied on the Index of 
Dissimilarity ("D")* Reamtly, howeveii, it has been demonstrated 
that cities with different racial compositions also have different 
expected values of 0^ under a random distribution of whites and blacks. 

Here we compare with another, increasingly used, segregation 
index which we call Applied to njeasurement of school segregation, 

S equals the proportional unde rre pre sent at ion of black students in the 
school artended by the average white student in the district. A 
priitcipa: va lue of _S is that its expected value is independent of racial 
composit:.OT and rapicLy approaches zero, even for modest-sized units of 
analysis. Also, S can be "decomposed" into segregation within subsets 
of entitifi=^ and segregation between subsets. 

The b<ehavior of S and D are explored with data on segregation in 
higher edn.-ation and segregation of workers across different places of 
employment- ^ 



iil 



Acknowl edgment s 

This paper was written with the assistance of James M. McPartland, 
Gary A. Chase, Denise Daiger and Gail E. Thomas. 



I 



iv 

o 

ERIC 



Mean -^ifis f ^Segregation" 

The' t(.^--??T3i "ssgregation" has a variety * spe.ii:^ mearri-^^ both in 
evsrp7tl«y '''^ctrmrse and in social scientific -^nc po-i.ic^ rese cor. Most 
otiin , it is nised to descijibe a single establi:. menr cr env • 'orr"^ \c (that 
is A '^cheic. 3 workplace, a neighborhoad) , 

?:.r exannple, a place may be considered ^regatec if ^sessr res of a 
30C^ y salient category constitute either a: erwhelit ngl^ ii-ax^ 
pre ; or the members or inhabitants (e.g. 957o b.^^ac^) c en 
exc 'iajLii>^ -7 ^mall piroportion (e.g., 5% black). The s <c? fic :eat:ure 
of sw'cfo a pla vje is that both the experience of ^ ing un hcmyg^fneous 
s£. . lor^i amd the experience of life as part of a ti!r>' *'ni inr: rJiyy* have 
iEiportamit co>iisequences for individual attitudes— soci.'^ ^ctic- ir 
Si: rue: ur^" events. 

notber use of the term "segregation" drsjs attrnt on t a specific 
p 'jf de^viates markedly from the averagr ^noup nembershiir composition 

atfc- of places in a certain universe. For unple, place 

iB^y j.iTisitiEred segregated if its racial ccanD- Itio is 507o olack and 
5' e b«^cause the places in this particu- iniver3ft average only 

2 ack and 80% white. Here the significar - atur . is t>.e inequality 

c p aces in the experience of group hetarpfe^^eit /. The "segregated" 

: . are those which most deviate from the ^^r^ age experience In the 
y: rse of places. 

In contrast to both of these uses of thi t*rrm "segregation," the 
«r7rc 1 s also applied to a collection or aggreg^i :e o collective units 
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such as a school district, r- :;ity divided into census tracts, or al" 
the places of businesses vithftr a particular industry. 9nsn so used». the 
emphasis is usually on the ^ee of variation among th^^- units 
within nhe aggregate. For example, the schools in cin' may be 
considered morr segregated tr^ a city B's if there is het variation 
the "percent black" among the schools in city A rt-aan sameng the schoo£ii: 
in city B. City A may have c more ethnically het^e; ^Reeneaus total 
enrollment than city B> but r±e distribution of ^ • Pi ,d from the various 
ethnic groups across the schools in city A is 1^ ifarm. 

Generally, the use of the term "segregati— i\ social 
scientific research has been closer to this la de . .ition than it ha5 
been to the others. For more than a decade, a ^ingit measure of 
segregation, the "dissimilarity index," has dcr aate the research 
literature on t:egregation. 

The Index of Dissimilarity 

The dissimilarity index, D, is based on :rzie unevenness of the 
univariate distributions of rwo groups (e.g., blacks and non-blacks) 
among units such as schools cr places of business. With the dissimilarity 
index, the measurement of unevenness is represented by a Lorenz curve. 

To understand D and the Lorenz curve", consider a set of mutually 
exclusive collective units like schools or places of business. First, 
array each of these units in ascending order according to the proportions 
of their members (students, employees) who are in a particular group 
(e.g., their percent black). A Lorenz curve, in this case, is 



given by g iTg )hing the cumulative proportion of all nom-blacks in the 
aggregate wmo belong to those units considered up to that point against 
the cumulative proportion of all blacks in those same units. (Figure 1. 
illustrates :±is curve,) In a completely integrated aggregate, if a 
unit conta:::^.::ad 10% of all the people in category X, iX would also 
ccxitain lOT all of the category Y's (non-X's), Undey; segregation, 
units that H£re "overrepresented'' among one group's popolation are 
necessarily underrepresented among the residual group. 

The value of the dissimilarity index is^given by the vertical 
distance becween the Lorenz curve and the point along the 45° diagonal 
(representing a completely integrated aggregate) at the point where the 
curve is tangent to a line parallel to the 45° line (see Figure 1). 
This verti;:al distance represents the accumulated underre presentation 
of one group's population up to the point where the next unit added 
is the one which has a proportionate allocation of members to the two 
groups considered. Thus, the dissimilarity index is the minimum 
proportion of members of a given group, presently located in units "under 
represented" in that group, who would have to move or be transferred to 
units where such persons are overrepresented in order for each unit to 
contain the identical fraction of oach groups' population. 

Where one is dealing with two exhaustive groups (such as blacks 

and non-blacks), D is given by: / 

^ n^ j p^ - P I where n^ = total persons, i^th unit; 

D = — 2 N * P (1-P) ' ' " percent black, i'th unit; 

P = overall percent black; 

N = total persons 
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Because D is based on a single point in the bivariate cumu^ ive distribution 
(the point of proportionate representation among the orde*ri=ii amits) , it 
does not distinguish between various degrees of segregati.ar. aaiang the 
"overrepresented" or among the "underrepresenCed" units. sn Figure 1, the 
dashed line represents a different universe where no one Is iotmd in an 
e;vctxrslvely black or non-black ujitj^ but which has the scame D score as the 
universe shown by the solid line. 

The concept of dissimilarity and its measurement sre not limited 
to situations of exhaustive dichotomies. The same noticm of under- and 
overrepresentation can be applied to any two mutually exclusive groups 
(e.g., Hispanics and blacks). Under these circumstances D has the 
following algebraic representation: 



I wne r 

H.B^ = corresponding sums 4^ H. , ? B... 



1 /o Z-j _i _Jl1 where H. = # Hispanics, i'th census tract; 
" i I H " B r b!" = # blacks, i'th census tract; 



The dissimilarity index has been in use for over twenty years 
(e.g., Taeuber and Taeuber, 1965; Marshall and Jiobu, 1975; Simkus, 
1978). It has a number of important and useful properties: it ranges 
in value from 0 (no segregation) to 1 (complete segregation); its 
value can be calculated independently of the overall composition of 
different groups in the aggregate ds a whole (e.g., overall percent 
black); it is applicable to situations where two groups are not 
exhaustive of the population; its value is easy to calculate, and it 
has an €\^sily interpre table verbal description as well as a graphical 
one. 
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Expected Value o£_D « 

Recently, however, both Cortese, Falk and Cohen (1976) and Winship 

(1977) have described a major drawback of the dissimilarity index. A 

principal use of segregation indices is to compare different aggregates 

of establishments or environments with respect to their relative degree 
N 

of segregation . For example, are blacks and whites more segregated in 
the first two years of college or in the last- two-year s ; are black and 
white managerial-level personnel more segregated from one another into 
different places of work than are black and white machine operators? 

In the calculation of the segregation of a given aggregate , a 
certain degree of segregation may be produced by "chance." That is, 
a random allocatic^ of the available blacks and whites to the establish- 
ments in the aggregate would not produce exactly the same racial 
distribution at each establishment. Clearly, if the establishments are 
quite small,' although blacks might constj.tute only about 10% of a 
given population, even a random distribution might produce units that 
are 50% black. 

Cortese, Falk and Cohen, and later winship, showed that the expected 
value of the dissimilarity index— that is, its value under a condition 
of randomly produced segregation alone--is not only dependent on the 
size of 'the units in the aggregate, but also is highly dependent on the 
group composition (proportion black) of the aggregate, in particular, ' 
the expected value of D is larger is. the aggregate increases in homo- 
geneity, and smaller as the two groups considered become more similar 



in total numbers. Table 1, adapted from Cortese, Falk, and Cohen 
(1976), shows the large magnitude of the dependency of the expected 
value of D on "percent black" and on the sizes of the units comprising 
the universe. 

-For example, industry A with a minority population of 30% and an 
average of 100 workers per establishment would have an expected value 
of its dissimilarity index, E[D], of only .09; on the other hand, 
industry B with only a 5% minority concentration and an average employee 
population of 50 would have an expected value of .26 to its dissimi- 
larity index. I If the actual dissimilarity index for industry A was .25 
and that for industry B was .30, it would seem more appropriate to 
consider industry A the more segregated even thoug^h its dissimilarity 
index was lower. \ 

The Coleman Segregation Index -"' 

There is an alternative measure of segregation to the dissimilarity 

\ 

index — one'tliat has many of the advantages of that index including its 
0 to 1 range, common-sense interpretations,^and its ease of calculation 

--but which is not nearly so subject to misleading results because of 

\ ' 

the "unequal expected value" problem. This index has be . independently 

2 

discovered on a number of occasions, as early as 1947, but most 
recently by Coleman, Kelly, and Moore (1976) in their studies of racial 
segregaition of public schools. Although this index goes by no common 
designation, we refer to it, as the Coleman segregation index because 
of its genesis for our own use. However, where Coleman designated it 
as "r", we designate it as "S" for "segregation." 




S is the proportional imderre presentation of one group In t he 
eavironment of the average member of aliother fsroup . It indicates the 
difference between the actual cross-racial experience of a group and 
the cross-racial experience that would exist under perfect integration 
of the two groups; that is, if all establishments had/the same distri- 
bution of the two groups. Thus, it is a measure of lack of exposure 
given availability. . - 

For example, in 1975, the privately employed l^bor force in the 
U.S. was 10.7% black. However, in the average (non-Hispanic) white 
worker s place of employment, only 8.7% of the workers were black. 
Therefore, the degree of segregation according to the Coleman S index 

w^s: c: _ 10.7 - 8. 7' 

S = — = 19 percent = .19. 

Algebraically, the average percent b^' : 'hites is given by the sum 
of the proportion, of blacks in each > . rn;'ent, weighted by the 
number of whites at that establishment, divided by the total number of 
whites. Thus: 

£^(w^ X p^) wliere = whites, i'th unit 

^blw " TUT » (establishment);' 

^ i = % blacks, i'th unit 

(establishment) . 



The segregation index, then, is: \ 

P - P, , hi - 

\ S = 5^-= 1 - — , where P - overall % black/ 

The S index is not restricted to the case where the two groups considered 
form an exhaustive set of the groups in the universe. That is, ''percent 

♦ 

black' can refer to the proport;ion of blacks among all people in the 

/ 

particular unit, including those who are neither white nor black. 



ERIC 42 



/ 



-8- 



ERIC 



However, in such situations, the S index between two non-exhaustive 
groups--for example, between Hispanics and blacks — will differ depending 
oh whether it is calculated based on the percent "X" for the average -- - 
••Y" or the percent "Y" for the average "X." However, the two S values 
in this case are usually reasonably close. For example, the employment 
segregation between black workers and Hispanic workers is either ,05 or 
,06 depending on whether it is calculated from "percent black for 
Hispanics"^r^from "percent Hispanics for blacks," 

In contrast to the dissimilarity index, there is no comparable 
graphical interpretation to the S Index, However, when the two groups 
considered form exhaustive categories (blacks and non-blacks, for example), 
S has several interesting properties. To begin with, S equals the rebetween- 
establishments^' proportion of the variance in the dichotomous individual- 
level variable, race, mftasured over alL persons in the aggregate; 

That is: o o 

< 2 ^,^2 where 
z. n - NP 

« ^ between SS ^ 1 i*^i N = # establishments; 

• . total SS NP (1 - P) ' P = overall % black; 

. ^^i'^i" ^^"^spondlng values 

^ ,of the i establishments. 

This relationship is derived in Appendix 1, ^ 

-Secondly, when the two groups are exhaustive, there is an additional 

interpretation of S. Namely, for blacks and non-blacks, for instance. 

It is the difference between the average percent black for blacks and V 

the average percent Ijlack for non-blacks. In other words, if the 

average racial environment for blacks is 407o black and the average for 

hon«*blacks in the same establistuoents *i!s 10% black, then S = ,40 - .10 

« .90. The derivation of this^ relal:iond|ip is shown in Appendix 2, • . 
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Thus, the Coleman Segregation Index has three interpretations: 
it 'is the proportionil underrepresentation of one grouj.'* in the environ- 
ment of the other in comparison to a situation of perfect integration; 
and when dealing with exhaustive categories, it is both the "between- 
establishments" proportion of the variance in the dichotomous variable 
distinguishing individuals in the two groups, and it is the absolute / 
difference ^bfetween the within-group environment for one group and the 
cross-group environment for the other, expressed as proportions. 

/■■ • ■ ■. ■■ ■ . ■ 

Definitions of "Equal" Segregatidn and the Dependency on P in Calculating 
Index Scores ' * . 

Taeuber and Taeuber (1965) have objected to indices similar to the 
Coleman segregation index because the group (racial) composition of the 

universe (P) is explicitly used in the derivation of index scores. In 

. ' • ' ( .... 

contrast, the dissimilarity measure is calculated independently of P., 
They suggest that the dependency on i> in the calculation of an index 
makes it possible for different index yalues to be associated with 
situations that are "equally Segregated" but which vary in P^. 

It is true that two universes with identical Lorenz curves but 

' ■ ^ ■ . . • ■ v> 

different values of P would have Idlentical'dissimilarity scores while 
the jEToleman S scores for these two universes would not be the same. " 
(The closer P becomes to 50%, the higher the S value would be.) 
^ However, this point merely moves the question of what constitutes 
"equal segregation" back a step. The Lorenz cnxy^^iit inay be recalled, 
is based on the distribution within each racial, categdry across units 
of analysis. For example, .consider two hypotheticat'"cities" with a 



total of three "census tracts" eac^ CTable 2). In both cities, one- 
tenth of air whites live in one t^ct along with haJLf of all blacks. 
In a second^ tract, 40% of all whites and 40% of all blacks reside; 
and in the third tract, the remaining whites (half of all whites) and 
the . remaining blacks (10% of all blacks) live. . 

In this, situation, regardless of the relative numbers of whites 
and blacks, the dissimilarity index would be the same (D=.40). In 
contrast, the S score would vary depending on the yalue of P. S would 

equal .37 where P=.50V for example, and would equal .22 where P=.09 

(or where P«.91). However, although from the perspective of the 
distribution within racial categories across tracts these two Situations 
are equiv^alent, it is not clear that one would normally consider'the 
situations to be "equally segregated.^' / 

Table 2 jshows the breakdown of blacks and whites by tracts in 
two hypothetical universes^ In the first, where P=.09, the three tracts 
have racial compositions of 2%, 8%, and 50% black. In the universe 
where P=.50, the three tracts are 9%, 50%, and 91% black. Variations 
in the racial composition are clearly larger on ah absolute scale in 
the latter universe, and the between-tract proportion of the total 
variability of "race" is larger in the latter situation as well, as 
shown by the larger S value for that universe. From this perspective, 
then, the two situations are not equivalent and one would not want a 
.segregation index to produce equal values for them. yj 

Thus, the invariance of the two indices of segregation, D and S, 

>> . ■ J . 

across situations of "equal degrees of segregation" depends totally 
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on the definition of equal segregation that is used. It Is not clear 

that the definition of equal Lorenz-curve inequality is necessarily 

superior to one based on explicit variability in group composition 

3 

(e.g., percent black) of the different units. In any event, the mere 

^- / ' ' ■ 

utilization of P in the calculation of the segregation index does not 

invalidate its utility or the appropriateness of the definition on 

which it is based. 

Perhaps Instead, the indices should be compared on other grounds. 

Expected Value of S 

We saw earlier that a major problem with the dissimilarity ir-d-:>;. 
was that its expected value differed markedly according to the number 
of people in each unit and according to the relative overall hetero-, 
-geheity in the universe. How, then, does the expected value of S vary 

as a function of these two parameters? 

"IiT Appendix 3, we derive an. approximation^ for the expected value 
of S, E[S] « k/N, ' where k is the number of units ('establishments) 
in the universe and N is the total-number of persons in the universe. 
Thls^^/rfolutlon applies td universes whose , units vary in size; but in 
addition, where all units are of equal size n, n^= N/k,^/and so EtS] ' 



simplifies to. 1/n, Thtiis, where schools in a district a^e all size 500, 
or where they average size "500, EtS] = .002. . ""v^ 

. • Two conclusions are apparent. ^ First, the expecti^d value of S, in 
contrast to that for the 'dissimilarity index, is not -/a function of P, 
the proportion of the universe population in group X. Secondlyi the 
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expected value is much smaller relative to the range of possible values 
than is the case for EtD] and rapidly approaches zero for even moderateily- 
sized average unit sizes. For example, where n = 25, E[S] .04, 
whereas EtD] ranges between .16 and .79 (for .Ol£p£,99). For n = 100, 
EtS] = ,01; E[D] varies between .08 and .37. 

Empirical examples 

Although the dissimilarity index and the segregation index are 
derived from different definitibns of segregation, they are measures of 
roughly similar concepts. Given appropriate conditions, their correla- 
tion across different aggrega{:es should be substantial. For example, 
Zoloth (1976) found that the respective measures of between- school 
racial segregation of students correlated .87 across 2,393 school 
districts with at least, a 57o minority student population. 

Hov/ever, there are situations where D and S have been found to be 



" 'negatiWly" 1correraFed with one another. For example, in the same study 
-referred to abovA, Zoloth found that the two indices of segregation 

used to measure between- school faculty racial segregation correlated 

ft ' . 

-.24 across these 2,393 aggregates. . 

The difference between these two results, is probably due to two 
factors: .;the significantly smaller size of the faculty populations 
at each school in comparison to the student poppl^tToh^^^ and the fact 
that the school districts^ included in the study were subject tJD a 
restriction that at least 5% of their student populations be from 
minority groups while there was no coi^arable restriction regardiixg the 
faculty ethnic composition. Thus t^e district-wide faculty varied 
over a wider range of "percent minority." 
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In Table 3, we present data from/arpther situation where the 
dissimilarity index and the Colemar^/s /index are not highly correlated. 
The data are from a study ^of . rap l/l segregation across places of 
employment for worke^ in the s^ne general occupational category such 
as manager, clerical yotker, ^pr operative. The study covers a represen- 
tative sample of 7,483 sepa^rate establishments in the private sector, , 
each of which employs at l^^ast 25 workers (Becker^ 1978) • 

w " / ■ .. ■ - 

Measures of racial segregation fcjr the nine Separate occupational 

■' \ \. ' ^ ■■/. ' ^ / ■ ■ 

categories are shown in the table. The dissimilarity index scores vary 
ii; the narrow range fi^om .48 to .64.- The Coleman S statistic varies 
between , 14 .and •40.y^ (The fact that the S scores are nearly always 
lower' is not ^ignif/cant because S is b^iised on proportions of ^ums-of- 
squares, while. D i^t based on proportions , of cases.) 

Howe vef,"^ across these nine uhiverses (o(^cupational categoric ^)^, 
the 'Overall corr^^lation between D and S is only +.26. The two, cate- 
gories with the lowest segregation measure by the Coleman S standard — 
managers and professionals— have two of tte highest dissimilarity index 
scores. These/ are also the. two occupational categories with the lowest 
percent biackv As we indicated eariier, where there is a small propor- 
tion in the V^itnority" group, the expected value of the dissimilarity 

index is parjticularly large (see Table 1) . Thus, the expected vdlue 

•\ ' " ■■ * ■ » 

of D for professional workers is approximately .45 while the expected 

value of D for laborers, for example, is only about .15.' If one were 



po subtract- the expected value of D from the victual scores (or alter- 
nately use; a 2-score transformation approximation suggested by Cortese^ 
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Falk and Cohen), the resulting ordering of segregation by occupation^ 
category would be similar to but not identical with the ordering obtained 
from the'' Coljsman statistic' ^ 

Applyii^g a correction factor for the expected value to the Coleman 
S index ^.e. S' = S - k/N) does not change the relative ordering of 
the, occupational categories. . However, it does increase the distance 
between them: managers and professionals become even, more distinctively 
the catego^ries with the least racial segregation and segregation among 
operatives becomes more similar to the other lesser-skilled blue collar 
categories which have the highest degrees of segregation — laborers and 
service workers,,^ 

It is instructive to note that \the ^C^leman index values are highly 
correlated with the percent black in the occupational category--^ the 

■/■■■•- -■ ■ .... ■ 

greater the proportion of blacks in a category, the higher- the segrega- 
t ion inde^ value. We also found thiis relationship to be true in loeasuring 
the racial segregation of students across institutions of higher 'educa- 
tion. When institutions are clustered by estate, the higher the -black 
proportion of the college population in a given state, the higher the 
racial segregatioi;i among institutions within that state. (See Table h\ 
see also, Thomas, et al., 1976.), , j 

We have seen, though; that tlie expected value of the Coleman* 
segregation index is independent o^ the racial composition of the universe 
and so tnese relationships must be produced by forces jpther than chance. 



It would be interesting to speculate about the causes of this associa 
tion between percent black and the segregation index S, but we leave 
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that for another paper. Here we only wish to aoserve that ^ the index 
that appears £o be contaminated by the relati\^-- group proportions in 
the population is in fact, free of such bias (i-^., the Segregation 
Index), while the index which appears empirically to be less correlated 
with the "percent black" factor and which is calculated independently 
of it (the Dissimilarity Index), is in fact the one whose expected value 
is strongly affected by this variable. 

Decomposition of Segregation Indices--Controlling on Other Variables 
One important property of a segregation index, as we have seen, 
is the resistance of its expected value to irrelevant variations among 
the^gregates whose, internal segregation is ^'^sing compared (such as 
averai^ unit size and overall group heterogeiwity) , A second important 
property of a segregation index is its amenmaility to the, decomposition 
of segregation into that portion^that is between sub-classes of units 
within the aggregate and that portion that ramains after controlling 
on the variable forming these sub-classes. 

For example, how much of the observed black-white residential 
^segregation is attributable to income differences between the races and 
how much to segregation Within income- classes? Or, how much of the 
racial segresgat ion in employment is due to unequal distribution of 
black aoEt-idxtte workers across the various occupational categories and 
how much is due to place-of-work segregation within occupational 
categories? ' * / 

Wiwborough (1974) showed how the dissimilarity index can be 
decominwd Into '-^ithin" arid "between" compDnents. However, in this 
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decomposition there is a third component that cannot be allocated to 
either 'V;i thin-compositional-category" segregation or "between- composi- 
tional-category" segregation. He describes it as "the differences in 
composition evaluated over the differences in composit:ion- specific 
distributions" (p. 3). 

Coleman's segregation index can also be decomposed when the two 
groups under consideration form exhaustive categories. Decomposition 
of the segregation index can be seen as an application of mult i- level 
hierarchical analysis of variance, in the two- level version of this 
model, we have, for example, students within schools, schools within 
state^, and states within a single region. The total sums of Squares 
(crqual to N*?*(l-P)) can be divided into portions representing variations 
within schools, between schools of the same state, and between states 
within the region. The proportion of the total sums of squares that is 
"between-schools/within-states" can be interpreted as the degree of 
segregation due to the varying grou^ composition of the sSRools within 
the same state. The proportioit of^the total sums of squarias that is 
"between- it a teg" indicates the segregation due to variation in the group 
composition across states. TaJale 5 presents the formulas from the two- 
level hierarchical ANOVA model applied to the measuremettt of segregation. 

Applying -thi« method to the^^data on racial segregation of workers 
by place? of empll&yxti^pat and by occupational category (Table 3), we find 

the foil wing results: 

i - I, . , 

' ■> ■ . ■ ' . ' 



V /, 



I 

I. 

I- 



A. Racial segregation by place of work for workers 

of all occupation categories combined, .190 (from Table 3) 

B. Racial segre^tion into different occupational 
categories and different places of work... .301 

(1) Segregation between occupational 

categories .047 

(2) ^Segregation between, places of work 

within occupational categories .254 

In this example, we are using each worker's occupational category 
as the variable to split up the universe of establishments into different 
"between-" and "within-" groups. Although this seems like a different 

situation than the exantt>le of sftudents-within»-schools schools-within- 

V ■ ' . 7 ■ ' 

states, states-within-a -region,, the principle is identical. 

Result "A" is reprinted from line 1 of Table 3; it refers to segre- 
gation measured when all workers at each place of employment are 

combined. In result B, we measure the segregation) of workers both by 
place of employment and by occupationial category. The level of segrega- 
tion in result B is higher than •'■-hat for result A. Part of this increase 
is du«?. to the different racial proportions in the various occupational 
categories. Thus, result B(l) indicates that there is some racial 
segregation in employment across occupational categories. This is 
the segregation we are most aware of: i.e., 'blacks concentrated in 
service and blue collar catiagories. However, by far the greatest 
segregation in employmeht is across the various places of business 
vithin each occupational category (result ^i^) • This measure of 
segregation is even higher than that given in A; that is, the racial 
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segregation among workers in particular occupational cat;egorles Is 
higher than the overall racial segregation of workers as a whole. 
Discussion and Conclusion 

In this paper we have examined some of the characteristics of two 
statistical measures of thje concept "segregation" as It has been applied 
to aggregates of collective units such as ^school districts containing 
schools, cities containing census tracts, and labor markets containing 
places of employment. '.^^ 

One of the two measures, the Index of dissimilarity, or D, has 

been* more widely used ..ilT^ocial scientific analysis of isegregation than 
any other index. This index, however, has been previously shown to 
possess a major disadvantage for the comparative study of segregation 
acrcfss different aggregaLes"namely that its expected value varies 
according to the size of the units in the aggregate and according to j 
the group (racial) composition in the aggregate as a whole. In com- 
parlsons where aggregates vary sharply in racial composition and where 
the units tend to be small--for example, measuring segregation of .. 
different categories of workers across places of emplGyment--this 
characteristic of the dissimilarity index can generate qbite misleading 
result's concern "^ng the relative segregation of different aggregates. 
Of course', where these conditions do not apply — for example, when 
comparing the segregation of cities with similar racial composition 
using large census tracts as units of analysis--this attribute poses 
much^ less of a. problem. However, other problems with the dissimilarity 
index — for example, its insensitivity to differences in segregation 
among the. '''over-represented*' unitis or among those "under-represented" 
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by a group under dlscusslon--may stll 1 proveht this index of dorlvotlons 
of It from being the index of choice (Winahlp, 1978). 

In. contrast, the alternative segregation index that we have dis- 
cussed in this paper--the index referred to as the Coleman Segregation . 
Index, or — seems to have few of these drawbacks. 

To summarize, S is the weighted average of cross-racial experience 
relative to the racial composition of the aggregate as a whole. It has 
several other verbal interpretations: 

(1) S is the proportional underrepresentation of one group in the 
environment the average member of another group; 

(2) S,is the between-establishments proportion of the variance 
in the dichotomous individual- level variable, race, measured 
over all persons in the aggregate; and v 

(3) When two groups are exhaustive, it is the difference between 
the average within-race experience of one group and the average 
cross-race experience of the other group. 

t In contrast to the often-substantial expected value of^^the dis- 
similarity index'when people are randomly assigned to different units, 
the expected value of S approaches zero under nearly all circumstances. 
Also, it is independent 6f /the racial proportions in the aggregate, 
whereas EtD] is notl 

The fact that E[D] ^and e(S] behave so differently can account for 
a number of instances w^iere D and S produce values that are either 
negatively correlated or only slightly positively relateci. Following 
up one of these instalncesi it was suggested that the observed association 
of S with the prpjpqrtion black in an aggregate 'may be indicative of a 



relationship of sv'^stantive importance--namely, that segregation is 
greater in universes with larger b'lack proportions even when the measure- 
ment is standardized for racial composition. ^ / 

Finally, it was shown that S can be decomposed into segregation 
between sub-classes of establishments in an aggregate and segregation 
between establishtaents within each sub-class. Racial, segregation in 
employment provided one empirical example: segregation of workers of 
the same occupational category into different places of work is of much 
greater magnitude than the oft en- observed racial segregation that exists 
between occupational categories. y 

Thus, although the S index has not had as wide a usage as has the 

dissimilarity index, it does appear^to be comparable or superior to the 

■ . ■ ' ■ , - ' 

V 

dissimilarity index in its interpretability , its ease of measurement , 

' ■ . <L 

it^ decoraposability, and especially, in its robustness for Ihter-aggregate 
comparisons. Future research in this area should consider the segregation 
index used recently by Coleman, Kelly, and Moore, especially when 
comparing the relative segregation of aggregates that vary greatly in 
their overall group^ composition and where sizes of units under study are 
^^^te small. , " . *. 
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Notes 

Taeuber and Taeuber (1976), commenting on the Cortese, Falk and 
Cohen paper, pointed out that much of the reason for the high 
expected value of the dissimilarity measure under conditions of low 
percent minority has to do with the minimum value that the dlsslml«* 
larlty Index can reach given the few persons that must be 
spread around the large number of units. Under these conditions, 
perfect integration is not possible. However, this fact illustrates 
the difficulty of using the index to compare the relatfive segregation 
of two different universes wHose initial conditions vary so greatly. 
What is necessary for comparisons is an index whose expected values 
are invariant over such situations. See also Massey (1978) and 
Cortese, Falk, and Cohen (1978). 1 

Duncan and Duncan (1^55) refer to several discoveries Including 
its discussion in Jahn, et al (1947) and Bell (1954). They refer 
to the index as the "eta" index while Taeuber and Taeuber (1965) 
discuss it as the Bell index. 

Winship (1977), comparing D with two other measures — D - E[D] 
and an index Identical to S — suggests that the comparison point of 
randomly produced segregation should* be used when examining segre- 
gation as a dependent variable, but that study of tt^e "effects of 
segregation" should employ D since "if makes little difference 
whether segregation is random or non-random." This is generally 
reaspi^ble, except that the ieffects for individuals of the experience 



of racial Isolation (Winship's example), are perhaps better measured 
by the individual's own specific racial experiences, and for the 
group as a ^ole, by "percent" X for the average Y. " V 
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The need for the approximation symbol derives from the fact that in 
the theoretical model one is sampling units without replacement 
whereas the calculation of the approximate expected value assumes * 
independent samples with replacement. Falk, Cortese, and Cohen (1978) 
Jiave, apparently found what they feel to be major discrepancies between 
the hypergeometric model (sampling without replacement)^ and the 

-binomial model (sampling with replacement) for their analysis of 
E[D], particularly when the n^ varry in magnitude. On the basis of 

. a reading of their published results, however, the discrepancy 
seems not to be sizeable; in any event, it is not clear that 
the same problem exif r.s regarding the- S index. 

Another correction procedure, the use of a standard score 

S - E [ S 1 ' 
S' = ' — 5 ^ — * would be superior to S. This is the 

cr^[s] ' 

procedure employed by Cortese, Balk, and Cohen (1976) to try to save 

the dissimilarity measure^ Unfortunately, we have^ot obtained a 

2 

solution to the value (J [S] at this point, althbugh Monte Carlo 
methods suggest themselves as a first step. 

In contrast to the calculations for Table 3, where the denominator 

includes blacks^ non-Hispanic whites and all others, these calculations 

consider only, black and non- Hispanic white workers to preserve the 

exhaustive dichotomy required for the ANOVA interpretation. 

In particular, we hkve workers of a given occupation (cf , students) 

working among others of the same -occupational category at a place 

of work (cf, schools)^ which is one of the universe of such "work 
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groups" of similarly employed workers (cf, states), which, when 
combined with the other ^*work groups" of the other occupational 
categories, produces the entire labor market (aggregate) under 
study (cf, region). 
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Figure 1: Lorenz Curve and The Dissimilarity Index 




Cumulative proportion of all non-blacks in the aggregate 
Units in aggregate ordered by black" in unit. 
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Table 1: Expected Value of the Dissimilarity Index 
under Random Distribution of ••Majority" and "Minority" Populations 
across the Units in the Universe (exhaustive groups) 



Expected Value of D: 



Where the "minority's" 
proportion in the 
universe is . . . 


And the 
'people 
10 


number of "minority 
in each unit in the 
25 50 


" plus "majority" 
universe is . . .• 
100 1000 


.01 


.91 


.79 


.61 


.37' 


.13 


.02 


- .83 


.62 


.37 


.23 


.09 


- .05 


.63 


.37 


.26 


. 18 


.04 


.10 


.39 


.27 


.18 


.13 


.04 


.20 


.30 


.20 


.14 


.10 


.03 


.30 


.27 


.18 


.12 


.09 


.03 


.50 


.25 


.16 


.11 


.08 


.02 



Adapted from: Charles F. Cortese, R. Frank Falk, and Jack K. Cohen, 
"Further Considerations on tlie tfethodological Analysis 
of Segregation Indices," American Sociological Review . 
41 (August, 1976): 630-637. 



ERIC 



- 32 ^ 



28 



Table 2: Values of S and-D under Two 
Hypothetical Situations 



Situation I: Percent black = 50% 





Perceat 


of all 


. Number of 


Percent 






Blacks 


Whites 


Blacks 


Whites 


^lack 


Segregation Indexes 


Tract A 


50% 


10% . 


. 250 


25 


91% 


D = .40 


Tract B 


40% 


40% 


225 


225 


50% 


S = .37 


Tract C 


10% 


50% 


"25 


250 


9% 


Percent black for 
average white = 31.6% 


Total, 
City I 

' - - , 


100% 


100% 


500 


500 


50% 





Situation JI: Percent black « 9% 



1 


Percent 
Blacks 


of all 
Whites 


Number of 
Blacks Whites 


Percent 
Black 


Tract D 


50% 


10% 


50 


50 


50% 


Tract E 


.40% 


40% 


40 


450 


•8% 


Tract F 


10% 


50% 


10 


500 


"2%. 


Total, 
City II 


100% 


100% 


100 


1000 


9% ■ 



Segregation Indexes 

D = .4 

S = .22 

Percent black for 
average white = 7.1% 
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TablQ 3: Segregation of Non-Hispanic Whites and Blacks 
across Places of Employment in the Private Sector, 
by Occupational Category 



ColemanT 
Segregation 
Index 



Dissimilarity 
Index 



Percent 
Black 



Mean Number Workers 
in this Category 
per .Establishment* 



All employees 


.19 


.48 


10.77, 


201 


Managers 


.15 


.64 


3.1% 


22 


Professionals 


.14 


.59 


3.0 . 


33 


Technical Workers 


.20 


.57, 


6.5 


25 


Sale's Worke;rs 


.21 


.56 


5.6 


31 


Clerical Workers 


.20 


. .48 


9.7 


37 


Craft Workers 


.16 


.49 


7.1 


45 


Operatives ' ^ — 


' .23 


.51 


14.3 


69 


Laborers 


.40 


.62 


21.0 


41 


Service Workers 


' .38 


. .58 


22.5 


32 



among establishments with one or more workers in this occupational 
category. 

: ' : : V- 

Source: Equal Employment Opportunities Commission. 1975. Annual Survey 
of> Private Employers. 



Table 4: Segregation. of Whites 
in 4'- Year Colleges by 



and Blacks (others excluded) 
State, Southern U.S. Region 



Coleman ^ 
Segregation Percent 
Index Black 



Sout'ViPTTi ^t'at'ric 


.60 


17.3 


Alabama 


.65 


24.9 


Arkansas 


.39 


17.5 


District of Columbia 


- 77 


43.5 


Delaware « 


.45 


11.8 


Florida 


.47 


12.6 


Georgia 


.04 


22.8 


Kentucky ^ 




7.6 


Louisiana 


. D / 


25.9 


Maryland 




19.4 


Mississippi 


.69 


33.4 


j.North Carolina 


.74 


21.2 


Oklahoma 


.25 


6.5 


South Carolina 


.62 


21.6 


Tennessee 


.49 


14.6. 


Texas 


.51 " 


11.5 


Virginia 


.75 


16.8 


West Virginia 


.09 


5.5 



Source: Office of Civil Rights, DHEW. 1976. Enrollments in 
Higher Education. 



Table 5: Decomposition of the Coleman Segregation Index by Application 
of Hierarchical Analysis of Variance (2- level ANOVA example) 



Soufce of Variation 

General . Example °^ Squares X^^j^ = [0, 1] Segregation Measure 



^ Between 
Sub-aggregates 


between ' 
states 


_ "me an- c o rre c ted 
between": 








^2 2 
N p . NP 

i "i i . 










N 




NP(1 - P) 




Between units 

within 
Sub-aggregates 


between 
schools 
within 
states 


/" 

.^^ between schools - 
between /states: 




I)- 


1 ; 


/ -f* n,,.p. . *r 
1 i -1 irn - 1 

/ ■■" NP( 1 - P) 


z 

N P 
i^i 


Between 
Individtials 
within units 


uithin 
schools 


^ total - between 
schools: 


i j k ijk 


ij 






Total 


Total 


i j ? ^ijk 


N 




• 






Where N = 1 £ n 
1 i j ij 


. h 


/ 

= -^n., X. 

/ ■ 


j k 


ijk 




^X' 
k *ijk 




36 
















37 
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Appendix 1: S as Between-Establishments Proportion 
' of Sums-of-Squares . 



We have that S = 1 - ly^d - P^) = i . £°iPi - ^^^Pj^^ 

P(£n^(l - P^) P(Cn^(l - p.) 



But £n^Pi = NP and ^n^(l - p^) = n(1 - P) 

So 
S 



^ _ NP - £n^Pi^ 



P ♦ N«(l - E) 

- P) - (NP - 
N • P (1 - P) 



NP(1 - P) - (NP -^n^P^^) 



S = 



S = 



NP 



- NP^ - NP +1\P^' 



NP (1 - P) 

2 ' 2 

In^p^ - NP^ 

, NP (1 - P) 



From the analysis of variance of X, . = -{ I ^^ack C \ 

ij t 0 iff non-black ) ^ 

^ 2 
Sum of square^ due to qiean = C = i j iy = NP 

" N 



Between-establishment X' ^^ <* 

• Sum Squares = ^ ^ " ^ " i "i^i " 

<-"i 



2 



Total sum of squares = ^N^^^j^ - C = NP - NP^ = NP(1 - P) 

2 

(since x^^ = x^^ for x = [0, 1]) 

X! 2 2 
betveen SS ^ i ^'i^i " 

total SS " N P (1 - P) ° (q.e.d.) 
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Appendix 2: S as Difference .between P^j^ and P^jj, 

The Within- and Cross-Group Environments 



<lj - 1 - Pj^> the proportion of non-blacks in unit i, and Q = 1 - p ) 



P.U=lVi =iWi and P,. "=^Vi =^Vi^i 



Then k i = - i i = and P„,- 
BJB B|B — 



1 



1 1 



Q(£n,p, ) - Pln,p,(l - p.) 
NPQ 



Q(£n p ) - P(ln p )'+ P(£n.p^) 

^ ^ -L i.' ^ 

■ NPO ' ' V 



P + Q (Ln.P, ) - Pdn.p.) 

1* (£n,p,^) - p.NP 
NPQ 



y 9 

^ ^Tn^p, - NP 
NP(1 P) 

^ between-establlshment SS 

total sum-of- squares " ^ (q.e.d.) 
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Appendix 3: .E[S]; The Expected Value of S, Equals -, 
The Average Number of Persons per Unit or Establishment. 
(Using a Binomial Approximation— Sampling "with Replacement- 
to the True Hypergeometric J)istribution) 



E[S] 



= E 



r 



NP 



k 



1 - 



i=i Vi 



NP (1 - P) 



= 1 - 



NP(1 - P) 
1 



fNP ~ E [5:n.p.2]} 



^ ""(nP - lEEii ] . E[p/] .. (Since n, and p. are 

NP(1 - P) ^ 1 i. '^i 

independently distributed.) 



Also, var p. = lO-I^ = E[p. - P]^ = E[p2j . 2.p.E[p.] + / = E[p2] - p^ 



• Pfn^l - P(l -P) , J> 



Then E[S] = 1 - 



= 1 - 



NP( 



^ i=i 1 
I _ p) (np - k P(l - p) - NP^) 



^(1 - P) - NPq - P) + k P(l - P) 
NP.(1 - P) 



N 



40 



